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ABSTRACT 

PROBLEM TO BE SOLVED: To provide a systematized knowledge analyzing device 
and classifying device for extracting a prescribed terminal class suited to 
the content of an unclassified document, and relating them even when the 
state of structured data is not sufficiently known. 

SOLUTION: In a systematized knowledge analyzing device 10, existing 
structured data and document information related with this are obtained, 
while a keyword extraction processing is operated to a document belonging 
to the same terminal class of the structured data, and a feature vector 
constituted of a significant word and weight is generated by a knowledge 
system dictionary preparing part 12, and the feature vector is stored as 
the feature of the terminal class with the obtained information in a 
classifying system knowledge dictionary 13. At the time of obtaining an 
unclassified document, the keyword extraction processing is operated, and 
the feature vector is generated by an information abstracting part 22 
of an automatic classifying device 20, and the matching of the feature 
vector with the preliminarily registered feature vector of each terminal 
class is operated by an information classifying part 23, and allocation to 
the terminal class whose matching level is high is operated. 
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ABSTRACT 

PROBLEM TO BE SOLVED: To automatically edit an abstract of high 
completeness by judging characteristics of a document to be edited and 
editing it in details matching with the characteristics. 

SOLUTION: A document characteristic judging part 1 judges characteristics 
of a document inputted to an abstract editing device and an abstract 



' editing part 2 genera^B an abstract of the doc||^pit containing 
specific words and phrases ^recording to the judged result, "^ren, a specific 
word and phrase retrieval part 10 included in the judging part 1 judges 
whether or not the inputted document includes a specific word or phrase 
such as 'judicial decision*. Further, an abstract editing part 11 edits a 
document, which includes no specific word or phrase into an abstract and 
a re-editing process judging part 12 judges whether or not the edited 
abstract needs to be re-edited or not by judging a distribution of 
important sentences including important words of the abstract 
Consequently, the best abstract of the document can automatically be 
edited by automatically recognizing characteristics of the document and 
this device is able to automatically edit abstracts of the document 
having different characteristics by itself. 
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Keyword index updating method for data search applications, involves 
receiving query fromVuser and updating keyword index so that modified 
keyword list including \iser * s query in keyword linked to searched 
data object \ / 

Patent Assignee: INT BUSINESS MACHINES CORP (/bMC ) 

Inventor: COLE A G; RAVIN yX SACHAR HE / 

Number of Countries: 001 Number of Patents/ 001 

Patent Family: \ / 
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Patent Details: \ / 
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Abstract (Basic) : US 6571239 Bl/ \ 

NOVELTY - A data objejzx corresponding to user's query is 
retrieved from the repository by search engine in response to user's 
query . The keyword incfex is updated \o form a modified keyword list 
which includes associated query in keywords linked to retrieved data 
object. / \ 

DETAILED DESCRLPTION - INDEPENDENT Cl^AIMS are also included for the 
following: / \ 

(1) data objeoJt search apparatus; \ 

(2) article op manufacture comprising computer usable medium; and 

(3) computer program product comprising computer usable medium 
storing computer readable program code. \ 

USE - For incrementally updating and modifying keyword index for 
searching data object e.g. document, abstract t image pattern in 
repository. \ 

ADVANTAGE - Prevents mismatch between the keyword employed by user 
and the manual or automatically assigned keyword, by updating keyword 
based on user's query . \ 

DESCRIPTION OF DRAWING (S) - The figure shows k flowchart explaining 
keyword and data object association process. \ 
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Information retrieval system for report mining, using generated virtual 
tables and database that define line patterns by type and establishing 
links that facilitate extraction 

Patent Assignee: SUNGARD EPROCESS INTELLIGENCE INC (SUNG-N) 
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Number of Countries: 001 Number of Patents: 001 
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Abstract (Basic) : US 6185560 Bl 

NOVELTY - Automatically generates virtual tables defining line 
patterns by type, based on location and frequency of occurrence in 
report and establishes links among those definitions to facilitate data 
extraction. Creates a virtual database of structural patterns and 
generates virtual records from the virtual database in response to 
user queries . 

DETAILED DESCRIPTION - Permits fully automated abstraction of 
patterns used in report based data streams. Enables abstracting of 
patterns existing as complete text lines in a report. Stipulates each 
and every line of data in report must match a pattern, and which 
addresses exceptions by either creating new patterns or modifying an 
existing pattern to include the exception. 

Abstracts well defined collection of non-overlapping patterns 
from report. Effectively disregards page breaks and non-significant 
text blocks in defining patterns . 

USE - End user access for query of business information reports. 
ADVANTAGE - Provides an improved report mining information 
retrieval system, which permits the data to be accessible for query 
by ordinary end users, as if the data were in a database, while 
retaining the inherent logic of the report design and the look and feel 
of the image format of the report. 

DESCRIPTION OF DRAWING(S) - Report mining information retrieval 
. system block diagram. 

pp; 21 DwgNo 3C/13 
Title Terms: INFORMATION; RETRIEVAL; SYSTEM; REPORT; MINE; GENERATE ; 
VIRTUAL; TABLE; DATABASE; DEFINE; LINE; PATTERN; TYPE; ESTABLISH; LINK; 
FAC I L I TAT E ; EXT RAC T 
Derwent Class: TOl 

International Patent Class (Main) : G06F-017/30 
File Segment: EPI 



33/5/14 (Item 12 from file: 350) 

DIALOG (R) File 350: Derwent WPIX 

(c) 2004 Thomson Derwent. All rts. reserv. 

012866489 **Image available** 
WPI Acc No: 2000-038322/200003 
XRPX Acc No: NOO-028924 

Integrated computer aided software engineering repository meta model 

system for software process methodology 
Patent Assignee: KOREA ELECTRONICS & TELECOM RES INST (KOEL-N) ; KOREA 

ELECTRONICS & TELECOM RES (KOEL-N); ELECTRONICS & TELECOM RES INST 



' (ELTE-N) 

Inventor: LEE H G; SHIN G S'^LEE H K 
Number of Countries: 002 Number of Patents: 
Patent Family: 

Patent No Kind Date 

US 5995969 A 19991130 

KR 99032818 A 19990515 

KR 269258 Bl 20001016 



Applicat No 
US 982386 
KR 9753970 
KR 9753970 



003 

Kind 
A 
A 
A 



Date 
19980102 
19971021 
19971021 



Week 
200003 
200030 
200138 



Priority Applications 
Patent Details: 
Patent No Kind Lan Pg 
US 5995969 A 
KR 99032818 A 
KR 269258 Bl 



(No Type Date) : KR 9753970 A 19971021 



Main IPC 
15 G06F-017/30 
G06F-017/40 
G06F-017/40 



Filing Notes 



Abstract (Basic) : US 5995969 A 

NOVELTY - Computer aided software engineering (CASE) repository 
(407) stores information of integrated CASE meta model using entity 
relationship model. The CASE repository is connected to meta entity 
type and relationship type instance generator (403) , entity type and 
relationship type table generator (404), meta entity type and 
relationship type searcher (405) and entity type and relationship 
type searcher (406) . 

DETAILED DESCRIPTION - An integrated CASE metal model (402) 
connected to user interface, models as entities and directional 
relationships according to type and meaning of information of 
components of integrated CASE tool for processor, using entity 
relationship model of conceptual modeling technique. From the 
integrated CASE metal model, meta entity type and meta relationship 
type instances, and entity type and relationship type tables are 
generated by respective generators (403,404) connected to user 
interface. Based on the generated instances and tables, meta entity 
type and meta relationship type and entity type and relationship type 
are searched by respective searchers (405,406) connected to user 
interface. An INDEPENDENT CLAIM is also included for integrated 
supporting method. 

USE - For process methodology used for developing high quality 
softwares . 

ADVANTAGE - The repository automatically and integratedly manages 
information of analysis and design for structured techniques supporting 
the software development process, to provide standardization and full 
automation of software development process and sharing of information 
between the CASE tools. 

DESCRIPTION OF DRAWING (S) - The figure shows architecture of 
abstracting system of integrated CASE repository. 
Generators (403,404) 
Searcher (405,406) 

CASE repository (407) 
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Abstract (Basic) : US 5963940 

NOVELTY - A natural yianguage query \ specif ying information to be 
retrieved is received and discourse aspeqts such as temporal or 
intention information /s determined based\on detecting evidence sources 
from query . 

DETAILED DESCRIPTION - A score for eacA determined discourse aspect 
is generated by statistical evidence combination method using 
regression formula, /and when the score exceens a threshold score, an 
alternate representation of query is generated . The documents in 
the database are c/rocessed to tag for discourse aspects. The alternate 
representation isr loatched against the databa^ by determining a 
measure of relevance for each document and a set'^^of documents 
satisfying a retrieval criterion is provided. 

USE - For natural language processing, 

ADVANTAGE - The user is able to interact with the system and refine 
systems interpretation. Higher order abstractions existing in human 
communication above word level are detected. The system automatically 
sorts, ranks and displays documents relative to query of any length 
and complexity. 

DESCRIPTION OF DRAWING (S) - The figure shows the block diagram of 
document processor. 
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Abstract (Basic) : US 5960383 A 

NOVELTY - The document is divided into sections which are compared 
with noun phrases. For each match , a count is incremented to obtain a 
final score from which the document sections are ranked, stored in 
memory and top sections are identified which are copied from document 
in original order to produce a synopsis . 

DETAILED DESCRIPTION - Noun phrases are automatically extracted 
from document, weightages are assigned and stored in memory. A natural 
language processor, clarit, is used to identify and rank the noun 
phrases. Ranking determines the number of times a noun phrase appears 
within the document as well as the frequency with which the noun phrase 
is typically used in English. The document is divided into sections 
such as lines, sentences, paragraphs or chapters. AN INDEPENDENT CLAIM 
is also included for the apparatus for automatically condensing a 
document . 

USE - In text management for providing a synopsis of material 
contained in a document to a reader. 

ADVANTAGE - The subject matter of a document as a whole is 
determined more precisely. 

DESCRIPTION OF DRAWING (S) - The figure shows the flow chart of the 
document condensation processes. 
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Abstract (Basic) : JP 11167568 A 

NOVELTY - Sentence is divided into words in a word processing unit 
(2) . The character variety currently used for the word is distinguished 
by a discrimination device (4) based on which extract of words is 
performed by a black speck extract unit (3). DETAILED DESCRIPTION - 
INDEPENDENT CLAIMS are also included for the following: recording 



medium storing program^^: paragraph processing; documei^^)rocessing 

method 

USE - For processing document for automatic abstract 
production or searching of specific word from predefined sentence. 

ADVANTAGE - Recognizes a word correctly by dividing the sentence 
into words, efficiently. DESCRIPTION OF DRAWING (S) - The figure depicts 
the block diagram of document processing apparatus. (2) Word processing 
unit; (3) Black speck extract unit; (4) Character variety 
discrimination device. 
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Abstract (Basic) : DE 19751955 Al 

NOVELTY - The robot is formed by a number of software integrated 
(IC) circuits. There is an IC controller to control the abstract 
machine and an IC model to describe the abstract world of the real 
machine to be controlled. 

DETAILED DESCRIPTION - An IC translator generates the language of 
the abstract machine. An IC robot implements actions of an abstract 
robot which depend on events of the IC controller. An IC robot device 
implements actions of a real robot, which depend on events of the IC 
controller. An IC switch outputs the actual internal state of the real 
machine to a display unit. 

USE - For laboratory experiments or automation of industrial 
processes . 

ADVANTAGE - Can be matched easily to the most different of 
robots . 
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networking environment Vike LAN, WAN, internet / 

Patent Assignee: INT BUSII*ESS MACHINES CORP (IBMC / 
Inventor: BROWN E W; PRAGeX J M / 
Number of Countries: 001 Nuinber of Patents: 001/ 
Patent Family: \ / 

Patent No Kind Date Applicat No Kimd Date Week 

US 5913208 A 19990615 Ufe 96677059 /a 19960709 199933 B 

Priority Applications (No Type Date) : US 9^677059 A 19960709 

Patent Details: \ / 

Patent No Kind Lan Pg Main IPc\ FiJling Notes 

US 5913208 A 18 G06F-017/30\ / 

Abstract (Basic) : US 5913208 A y\ 

NOVELTY - The hit list reco4dV>air is generated according to the 
documents and selected intrinsic aVtributes . The intrinsic attributes 
expressed as function of dociment iWgth etc., of hit list record pair 
are compared to detect duplication <yf documents. 

DETAILED DESCRIPTION -Ahe attribute fields containing one or more 
attributes of the documen/s are included in each hit list record. The 
intrinsic attributes that is score expVessed as function of document 
length, title, concept , /author , publication date and abstract is 
selected. The intrinsiyc attributes are invariant with location and 
replication of document. The function thai: relates intrinsic attributes 
is probabilistic mea^surement or cosine similarity measurement. An 
INDEPENDENT CLAIM /s also included for computer system. 

USE - In compucer networking environmentNjLike LAN, WAN, internet. 

ADVANTAGE - Enables identification of dupMcate documents from 
search results without comparing content withrn documents in network. 
Reduces detection cost by avoiding need for accetes and analysis of 
document content. Facilitates identification of aocument like book, 
article, report that contains text, image or multimedia information. 

DESCRIPTION OF DRAWING (S) - The figure shows b\ock diagram of 
computer system. 
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S\ibject field code vector representation generation method for foreign 
language processing system 
Patent Assignee: UNIV SYRACUSE (UY6Y-N) 



U E S 

er of Patents: 001 



inventor: LIDDY E D; PAIK 
Number of Countries: 001 
Patent Family: 
Patent No Kind Dat^^ Applicat No 

US 5873056 A 199902^6 US 93135815 



Kind Date Jt^eek 
A 19931012 ^^99914 B 



:ent of the document and they 
word. INDEPENDENT CLAIMS are included 



subject field code vector 



Priority Applications (No ^ype Date): US 93135815 A 1^31012 
Patent Details: 

Patent No Kind Lan Pg MaYn IPC Filing Notes 
US 5873056 A 21 G06F-\017/30 

Abstract (Basic) : US 5873056 

NOVELTY - A specific Subject code is Aelected from identical codes 
within each sentence, thatXoccur uniqueW and at equal to or more than 
a certain frequency. Codes Vor each vjo/d is correlated with selected 
unique codes. Codes with highest corr/elation is selected and usage 
frequency of the word represented bw^it, is determined. The codes are 
arranged into a weighted vectW representing the document content, 

DETAILED DESCRIPTION - Sut^jecy codes assigned to each word of a 
document express the semantic 
correspond to the meaning of ea^ 
for the following: 

(a) Natural language proc^ssi^ig system; 

(b) Apparatus for gener/ting 
representation of the document 

USE - for foreign lang/age processing system 

ADVANTAGE - Provides / query wJiich shows high similarity to the 
representation of the do/uments since the representations of the 
document and the query/ represent tnfe topic at an abstract , semantic 
field level, thereby marking document Retrieval more efficient than with 
conventional key word/ searching procledures . Assignment of subject 
codes is automatic ^nd can be carried out under computer control 
without the need for/human intervention^ The usage of lexical database 
enables the sub j ect/codes assignment to\be automatic and efficient. 
Text may be processed in reasonable amount of time. Enables automatic 
classification of /iocuments using subjecft codes having disambiguator , 
which operates iry heuristic and psycholin^puistic manner, mimicking the 
'ion . 

)F DRAWING (S) - The drawinV illustrates a flow chart 
for subject field vector Veneration and document 
id retrieval. 
Ill 
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Text summary automatic computer-based generation method - sorting 
text sentences by probability factor to indicate best matched sentences 

Patent Assignee: SIEMENS AG (SIEI ) 

Inventor: BRUECKNER T 

Number of Countries: 019 Number of Patents: 004 
Patent Family: 

Applicat No Kind Date Week 
WO 98DE485 A 19980218 199844 B 

EP 98914784 A 19980218 200006 

WO 98DE485 A 19980218 



Patent No 
WO 9841930 
EP 968478 



Kind Date 
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*JP 2001515623 W 2001091^^P 98540006 A 19980218 ^H.69 

98DE485 A 19980218 

US 6401086 Bl 20020604 WO 98DE485 A 19980218 200242 

US 99381180 A 19990916 



Priority Applications (No Type Date) : DE 1011284 A 19970318 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
WO 9841930 Al G 18 G06F-017/30 

Designated States (National) : JP US 

Designated States (Regional): AT BE CH DE DK ES FI FR GB GR IE IT LU MC 
NL PT SE 

EP 968478 Al G G06F-017/30 Based on patent WO 9841930 

Designated States (Regional) : DE FR GB 

JP 2001515623 W 12 G06F-017/30 Based on patent WO 9841930 

US 6401086 Bl G06F-017/30 Based on patent WO 9841930 

Abstract (Basic) : WO 9841930 A 

The method is for automatic generation of a summary or 
synopsis from a text by computer. A probability is determined for each 
sentence or clause, that the sentence or clause belongs to the summary 
or synopsis , while for each word in the sentence out of a dictionary, 
that contains all the relevant words with a given measure of relevance 
to each of these words, the measure of relevance is determined and all 
the measures of relevance are then accumulated to yield the probability 
for the sentence belonging to the summary. 

The sentences of the text are then sorted by probability factors, 
and the best matched sentences are indicated, in correspondence to a 
given reduction measure, to the summary, in a sequence given by the 
text. 

USE - For selecting relevant and significant information form a 
mass of data e.g. in form of articles, and particularly in summarising 
various texts . 

ADVANTAGE - Provides summary or synopsis of text which briefly 
gives most important points/content of text. 
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Identification method for documents stored in computer readable medium - 
in which computer user browses dociaments by prompts to construct query 
expressions from autogenerated list of keyword phrases 

Patent Assignee: RUBINSTEIN S I (RUBI-I) 

Inventor: RUBINSTEIN S I 

Number of Countries: 080 Number of Patents: 005 



Patent No 
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Priority Applications (No Type Date) : US 96687656 A 19960726; US 96628098 A 
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Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 

WO 9804982 Al E 26 G06F-017/30 

Designated States (National) : AL AM AT AZ BA BB BG BR BY CA CH CN CU CZ 
DE DK EE ES FI GB GE GH HU IL IS JP KE KG KP KR KZ LC LK LR LS LT LU LV 
MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT UA UG 
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Designated States (Regional): AT BE CH DE DK EA ES FI FR GB GH GR IE IT 
KE LS LU MC MW NL OA PT SD SE SZ UG ZW 
EP 979466 Al E G06F-017/30 Based on patent WO 9804982 

Designated States (Regional): AT BE CH DE DK ES FI FR GB GR IE IT LI LU 
MC NL PT SE 

US 5721897 A 15 G06F-017/30 CIP of application US 96628098 

AU 9736611 A G06F-017/30 Based on patent WO 9804982 

TW 347516 A G06F-009/06 

Abstract (Basic) : WO 9804982 A 

The method for browsing documents involves automatically 
identifying for a user, keyword phrases in the documents (110), 
prompting the user to construct a query expression in which at least 
one of the keyword phrases is an operand (120) and identifying one of 
the documents based on the query expression. 

The identified document is presented to the user in the form of an 
abstract (140,150), and identification of the keyword phrases and 
generation of the abstract are accomplished by linguistically 
analysing the documents (140) . A user interface featuring a tabbed 
index into either keywords or key phrases in separately scrollable 
display areas provides indexing into the content of each screen area. 

USE - Browsing by prompted keywords with user interface, to 
identify documents by prompted keyword phrases. 
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Computer automatic index technology of Chinese -character document 

Patent Assignee: WANG Z (WANG-I) 
Inventor: BAO A; WANG C; WANG Z 

Number of Countries: 001 Number of Patents: 001 
Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

CN 1110814 A 19951025 CN 94103485 A 19940418 199736 B 

Priority Applications (No Type Date) : CN 94103485 A 19940418 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
CN 1110814 A 1 

Abstract (Basic) : CN 1110814 A 

The new automatic indexing technique for Chinese-character 
documents is a break-through in Chinese information processing, which 
features no need of creating words or phrase library and no 



limitation to newly cr^^^d words or phrase, so matchii^^with the 

high-speed development modern science and technology 

ADVANTAGE - Provides high indexing speed indexing 1500 abstracts 

in 5 min. and accuracy (near 100%) . 
Dwg. 1 

Title Terms: COMPUTER; AUTOMATIC ; INDEX; TECHNOLOGY; CHINESE; CHARACTER; 

DOCUMENT 
Derwent Class: TOl 

International Patent Class (Main) : G06F-017/30 
File Segment: EPI 



33/5/24 (Item 22 from file: 350) 

DIALOG (R) File 350: Derwent WPIX 
(c) 2004 Thomson Dernent. All rts . reserv. 



A 



010631925 **Image available** 

WPI Acc No: 1996-128878/\99613 

Related WPI Acc No: 1999-^93951 

XRPX Acc No: N96-108434 

Iterative and interactiveX formal specification generation method 
involves creating formalX specification fM system using specification 



language and iteratively repeating steps 
behaviour and is realisable 

Patent Assignee: LSI LOGIC CORP\(LSIL-N) 

NAGASAMY V 

001 Number \f Patents 



intil izciplementation has desired 



Inventor: DANGELO C; 
Number of Countries: 
Patent Family: 
Patent No Kind 



Date 
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Applic3it JNo 
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Kind Date 
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Week 
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Priority Applications (No Type Date) 
Patent Details : 

Patent No Kind Lan Pg Main IPC 
US 5493508 A 31 G06F-017/5/ 



JS 94252231 A 19940601 



FiMng Notes 



Abstract (Basic) : US 5493508 A 

The method involves establishing a ciesired behaviour for a complex 
digital system. A formal specification foy: the complex digital system 
is created using a formal^ directly-execut^able specification language. 

The following steps are iteratively performed. The syntactical 
correctness of the formal, specification is checked. The logical 
consistency of the forirml specification is chalked. The completeness of 
the formal specification is checked. The correotness of the formal 
specification is checked. The desired behaviour's compared with the 
behaviour described by the formal specif ication. VThe existence of at 
least one realizable/implementation of the formalXspecif ication is 
determined. Any problems detected are corrected by\ revising the formal 
specification. Thisf is done until the formal specimcation is 
syntactically correct, logically consistent, correcr^ and complete, 
reflects the desi/:ed behaviour, and has at least one\realizable 
implementation . 

ADVANTAGE - ^llows work at high abstraction leve\s with large and 
complex systems. Automatically translates behaviouralX circuit 
descriptions into implementations. Raises level of design validation 
from structural (net-list) to behavioural level, Provide^standardised 
environment reducing need for cross training between platrterms and 
directing resources to testability. Provides automated, interactive and 
iterative technique for complex specification. Allows executi'on of 
specification in automated fashion to give implementation. Coordinates 
system and subsystem specifications with design implementations. 
Ensures specification changes made are reflected and accounted for in 
system wide fashion. Converges on feasible implementation. Allows user 
to query properties of formal executable specification. Assists user 
in understanding system properties and subsystem relationships. Allows 
user to asses completeness of specification. Allows determination of 
correctness of specification w.r.t, intended system properties. 
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Automatic doc\iment abstractor - using hint words in search of 

document to extract relevant sentences and allowing for their 

juxtaposition 
Patent Assignee: TOSHIBA KK (TOKE ) 
Inventor: DOI M 
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Priority Applications (No Type Date) : JP 88245967 A 19880930 
Cited Patents: No-SR.Pub; 2.Jnl.Ref; EP 32194; JP 61248160 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
EP 361464 A E 21 

Designated States (Regional) : DE FR GB 
US 5077668 A 19 
EP 361464 Bl E G06F-017/24 

Designated States (Regional) : DE FR GB 
DE 68928775 E G06F-017/24 Based on patent EP 361464 

Abstract (Basic) : EP 361464 A 

The appts . allows 'hint' words to be selected from a dictionary (3) 
which will relate to significant phrases in a document held in a memory 
(2) . The document is then searched for the sentences containing the 
hint words and the sentences are abstracted (4) and juxtaposed using 
the input (1), output (5) and control (6) sections to produce a 
suitable abstract . 

Where the number of hint words produces a lengthy document, a 
further processing step can be applied. This provides a logical 
structure storage (7) and analysis (8) which provides morphological 
analysis of the abstracted sentences and a modification unit (9) 
which together allow titles etc. to be eliminated, and a more relevant 
selection of sentences to be abstracted . 

USE/ADVANTAGE - Efficient abstracting of documents held on 
document retrieval system. 
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Abstract (Basic) : CA 2446831 Aly 

NOVELTY - The system infcludes aXsystems interface that defines 
logical import and export / interfaces , data transformation rule sets 
and scripts. A metadata /database sr^res the interfaces, sets and 
scripts. A script processor (37) us es\ metadata (38) from the database 
to control data trans fomnation within the systems interface and 
movement of the data iryco and out of tire distribution system. A rule 
set processor (36) responds to the scrint processor to manipulate a 
data bag (35) to stor^^ imported data and\a data bag for storing 
exported data. 

DETAILED DESCRIPTION - INDEPENDENT ClAlMS are also given for 

(1) a method of /controlling data transformation within a 
distribution system^ and 

(2) a computer /readable memory for transforming and exchanging 
datastore data beyween heterogeneous computer systems. 

USE - For distributing data between heterogeneous computer systems . 

ADVANTAGE - Provides a simple mechanism up specify transformation 
definitions and/to control the flow of data faom an input source to an 
output target ./Allows dynamic response to changing business * 
environment, won-intrusive to minimize ef f ects \of changing data formats 
or data storage types. 

DESCRIPTION OF DRAWING (S) - The drawing shoves a block diagram of a 
system for transforming and exchanging data betwlfen heterogeneous 
distributed computing environments. 

Data bag (35) 

Rule set processor (36) 

Script processor (37) 
Metadata (38) 
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Continuation document id^^ftfi cation method in website, xi^Hves 
identifying lateral links^ro continuation documents of prdSRy document 
by identifying whether continuation doc\ament terms are associated with 
links 

Patent Assignee: PALLMANN D (PALL-I) 
Inventor: PALLMANN D 
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US 20020078014 Al 35 G06F-007/00 

Abstract (Basic) : US 20020078014 Al 

NOVELTY - The primary document identified by the document address, 
is retrieved using computer executable logic. The links for the other 
documents provided in the primary document, is identified. The computer 
system identifies which of the identified links are lateral links to 
continuation documents of the primary document, by identifying whether 
any continuation document terms are associated with the links. 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are included for the 
following : 

(1) Continuation document identification system; 

(2) Doctunent crawling method; and 

(3) Document crawling system. 

USE - For identifying continuation documents in website. 

ADVANTAGE - Continuation documents within a domain can be 
effectively identified and previously processed links can be easily 
determined. 

DESCRIPTION OF DRAWING(S) - The figure shows the hierarchical 
structure of the documents in the website, 
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T?) OR SPIDER? OR SPYDER? OR INTELLIGENT () AGENT? OR SOFTBOT? - 
OR lA OR BOT OR BOTS 

56 0 CRAWLING (N) (DOCUMENT? OR FILE? OR TEXT? ? OR RECORD? ? OR - 

REPORT? ? OR BRIEF? ? OR INFORMATION) 

57 660 METADATA OR META()DATA 

58 171 (LINK? OR CONNECT? OR INTERFACE? OR JOIN? OR UNITE?) () (DATA 

OR INFORMATION) 

59 5717 CACHE? OR REGISTER OR MEMORY OR STORAGE OR BUFFER? OR REPO- 

SITORY 

SIO 7162 SAVE? ? OR SAVING OR STORE OR STORING OR KEEP OR KEEPING OR 

PRESERV? 

311 9461 ACQUIRE? OR GET OR RETRIEVE? OR OBTAIN? OR PROCURE? OR CALL 

OR FETCH? 

512 38789 ENGINE? OR MODULE? ? OR ENGINE? OR COMPONENT? ? OR ELEMENT? 

? OR ROUTINE? OR APPLICATION? OR PROGRAM? OR DATABASE? OR DA- 
TA ( ) BASE? 

513 1518 INDEXING OR INDEX OR INDEXES OR INDICES 

514 1 SI AND S2 AND (S3 (2N) S4) 

515 0 S5 AND S6 

516 0 S4 AND Sll AND (S7 (2N) S8) 

517 29 S5 AND Sll AND (LINK? OR CONNECT? OR INTERFACE? OR JOIN? OR 

UNITE?) 

518 0 S17 AND S7 

519 0 S9 AND S8 AND SIO AND S5 

520 1 (S4 (2N) S12) AND S2 AND S4 AND S7 

521 0 (S12 {2N) S13) AND S13 AND S7 AND S8 
322 0 312 AND S13 AND 37 AND S8 

S23 3 S3 AND S7 AND 38 

324 2314 (SEARCH? OR QUEST? OR PURSU? OR SEEK? OR QUER? OR MATCH?) (- 

2N) 312 

325 0 324 AND 314 

326 33 314 OR 317 OR 323 

327 10 326 NOT PY>2000 

328 6 327 NOT PD>20000211 
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01693103 DOCUMENT TYPE: Product 

PRODUCT NAME: WebZIP (693103) 

Spidersoft (642487 
16 Argyle St 

Donvale VIC3111, Australia 
TELEPHONE: ( ) 613-98415251 

RECORD TYPE: Directory 

CONTACT: Sales Department 

With Spidersoft *s WebZIP, Web users can download Web pages and entire Web 
sites for offline browsing. WebZIP captures images, sounds, and media files 
to users' hard drives. The system streamlines browsing, and it lets surfers 
store online content quickly. Files are saved in native HTML format, 
letting users employ captured links . Original filenames and directory 
structures also are saved. WebZIP can be extended with the FAR add-in tool, 
which lets users convert downloaded Web content into compiled HTML-Help 
files. The compressed files support full text searches, table of contents 
panes, as well as index and favorites panes. WebZIP 's Task Editor lets 
users schedule content downloads and to specify the type of files to be 
retrieved . Users can also define the comprehensiveness of downloads. The 
system works with Microsoft Internet Explorer 4 or later. 

DESCRIPTORS: File Compression; File Transfer; Information Retrieval; 
Internet Utilities 

HARDWARE: 8048 6; IBM PC & Compatibles; Pentium 

OPERATING SYSTEM: Internet Explorer; Windows; Windows NT/2000; Windows XP 
PROGRAM LANGUAGES: Not Available 
TYPE OF PRODUCT: Micro 

POTENTIAL USERS: Cross Industry, Frequent Web Users, Researchers, Offline 

Browsing 
PRICE: Available upon request 

OTHER REQUIREMENTS: 16MB RAM; Win 9x+; Explorer 4+; 80486+ CPU required 
REVISION DATE: 20020715 
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01002151 DOCUMENT TYPE: Product 

PRODUCT NAME: DQbroker (002151) 

Decision Support Inc (674168) 
624 Matthews-Mint Hill Rd #150 
Matthews, NC 28105 United States 
TELEPHONE: (704) 845-1000 

RECORD TYPE: Directory 

CONTACT: Sales Department 

DQbroker (R) is the foundation of Decision Support *s suite of products, 
which are enterprise data integration and extraction, transformation, and 
loading (ETL) and Web-based reporting tools. DQbroker lets users find 
needed data independent of database vendor, hardware platform, or location. 
This gives users and developers simultaneous access to all available data 



real time . It enabl^Biovement of notably less data ^^^ss the 
network when processing queries that join distributed data. ^Rroker is 
significantly more efficient and effective at accessing and joining 
enterprise data than three-tiered, ODBC-based, and hub design solutions. 
The basic installation and configuration of DQbroker can be accomplished in 
less than an hour by any system administrator. It is easy. DQbroker has a 
thin-client architecture that distributes the processing of queries as 
close to the source data as possible. Queries that access and join data 
from multiple heterogeneous data sources are processed on multiple servers 
simultaneously. Only the data necessary to resolve the query crosses the 
network. This differs from the fat-server, three-tier approach common in 
the marketplace today, which requires entire datasets to move across the 
network to be processed by a single hub server. DQbroker converts a single 
SQL query that joins distributed data into multiple SQL queries. Each 
subquery is targeted at a single data source. The subqueries contain as 
much selection, filtering, joining, and sorting as possible. This allows 
DQbroker to leverage the capabilities of each database management system 
(DBMS), minimizing the amount of data returned and the time needed to 
return it. The processing happens as efficiently as possible because 
DQbroker communicates with all major relational database management systems 
using native access routines. A global metadata cache enables every 
DQbroker server to know the current state of all data in a distributed 
domain. Caching metadata also makes the retrieval of database properties 
faster. This accelerates query processing in a distributed environment 
because DQbroker has all the information it needs to distribute subqueries. 
DQvista works with DQ broker to provide enterprisewide reporting that 
joins data sources. 

DESCRIPTORS: Database Management; Distributed Processing; Enterprise 

Application Integration; Integration Software; Metadata ; Middleware; 
System Performance 

HARDWARE: Alpha; HP; IBM; IBM PC & Compatibles; NCR; Sun; Thin Clients; 
Unisys; UNIX 

OPERATING SYSTEM: AIX; HP-UX; Linux; Solaris; UNIX; Windows NT/2000 
PROGRAM LANGUAGES: ActiveX; C++; Java; SQL 
TYPE OF PRODUCT: Mini; Micro; Workstation 
POTENTIAL USERS: Cross Industry, IT Managers 
PRICE: Available upon request 

DOCUMENTATION AVAILABLE: User manuals 

TRAINING AVAILABLE: Technical support; training; telephone support; 

on-site training; Internet support 
OTHER REQUIREMENTS: TCP/IP connection required 
SERVICES AVAILABLE: Consulting 
REVISION DATE: 20040215 
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00122282 DOCUMENT TYPE: Review 

PRODUCT NAMES: Ultraseek Server CCE 3.1 (742627) 

TITLE: Any Document, Any Place, and a Place for Every Docxament 

AUTHOR: Westmacott, Ian 

SOURCE: Server/Workstation Expert, vlO nl2 p56(3) Dec 1999 
HOMEPAGE: http: //www. cpg. com 

RECORD TYPE: Review 
REVIEW TYPE: Review 
GRADE: A 

GO. corn's Ultraseek Server, a document search engine and indexer, is 
available with the Ultraseek Server CCE 3.1 add-on, an excellent choice for 



users who need directory-s^^b search abilities. Ultraseek ^^^er works 
very well as a search engine for distributed text-based doci^^ts. 
Ultraseek Server provides English indexes and partial support for other 
languages, a sophisticated query interface , an intelligent scanner, and a 
customizable scripted results interface . Platforms supported include 
Solaris, Linux, and Windows NT, and symmetric multiprocessing systems are 
also supported. The server can index documents by spidering a network 
(beginning with a specific URL, then tracing links found in follow- on 
documents); scanning a local file system; scanning Usenet newsgroups; 
scanning Web-accessible Microsoft Exchange public folders; or by mirroring 
other Ultraseek Server indexes. Database content indexing is not supported. 
Document types supported include HTML, XML, TRF, Word, Excel, PowerPoint, 
FrameMaker, Portable Document Format (PDF), PostScript, WordPerfect, Lotus 
1-2-3, WordPro, and Freelance Graphics. The scanner runs continually to 
find and retrieve documents to be indexed; automatically removes deleted 
documents from the index; and updates altered documents. Many other useful 
features are described. 

PRICE: $995 

COMPANY NAME: Overture Services Inc (643122) 
SPECIAL FEATURE: Screen Layouts Charts 

DESCRIPTORS: IBM PC & Compatibles; Indexing; Information Retrieval; 

Intranets; Linux; Search Engines; Solaris; Text Retrieval; Windows 

NT/2000 
REVISION DATE: 20040627 
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00120309 DOCUMENT TYPE: Review 

PRODUCT NAMES: UpShot Online (779288); Actuate Reporting System (613941) 
; Mainspan (750212) 

TITLE: Business Tools Get 'Webified* 

AUTHOR: Edwards, John 

SOURCE: CIO, vl2 n20 p40(3) Aug 1, 1999 

ISSN: 0894-9301 

HOMEPAGE: http: //www. cio. com 

RECORD TYPE: Review 

REVIEW TYPE: Product Analysis 

GRADE: Product Analysis, No Rating 

Upshot Sales from UpShot Corporation, Actuate Reporting System from Actuate 
Software Corporation, and Mainspan from Webridge are Web- based 
administrative tools that bring new capabilities to sales, reporting, and 
corporate communication. UpShot Sales has a browser- based interface that 
lets team members in any location have universal access, allowing strategic 
information to be disseminated throughout an organization. Its e-mail and 
chat capabilities supports sales force collaboration. Actuate Reporting 
System has been adapted to let organizations distribute reports to 
managers, employees, and external partners via the Web, where an Actuate 
Web Agent tool transforms the program's output into a Web site with 
individual URLs for specific reports and folders. Actuate will also 
automatically push notices about new reports directly to targeted 
individuals. Mainspan spans the knowledge gap that sometimes arises between 
product manufacturers and key customers by letting vendors use the World 
Wide Web as a distribution medium for a wide array of confidential 
information, which will also accelerate the information distribution 
process . 



COMPANY NAME: Siebel Systems Inc (608246); Actuate Corp (618764); 
Webridge Inc (662704) 



DESCRIPTORS: Internet Uti^pies; Intranets; Marketing Info^^^ion; Report 

Generators; Sales Analysis; Sales Force Automation 
REVISION DATE: 20031222 
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00116827 DOCUMENT TYPE: Review 

PRODUCT NAMES: ISYS: spider 1.5 (752088 

TITLE: Spider spins power searches 

AUTHOR : Rapo za , Jim 

SOURCE: PC Week, vl6 nlO p32(l) May 10, 1999 
ISSN: 0740-1604 

RECORD TYPE: Review 
REVIEW TYPE: Review 
GRADE : B 

ISYS/Odyssey Development's ISYS: Spider 1.5 adds Web site searching to the 
vendor's suite of corporate search tools. ISYS: Spider is an easy-to-use 
agent that searches and indexes external World Wide Web sites, and when 
used with Isys: Desktop and Isys:Web, it allows corporations to construct 
and maintain indexes of documents stored on networks, company Web sites, 
and particular external World Wide Web sites. ISYS: Spider gets good marks 
for usability, performance, and manageability, while capability and 
interpretability are rated average. Spider *s only features are the 
ability to search sites and to index their contents, and it lacks some 
features provided in competing search products, such as Plumtree Server and 
others, which can scan many data repositories, including internal and 
external databases. Such products as Netscape Communications' Compass 
Server provide personalized searches that send back links based on users' 
criteria. To obtain full-fledged Web support, testers installed Desktop, 
then Web, then Spider - Creating and indexing document databases is easy, 
but managing databases should be easier, since the tools do not ease tasks 
required to search indexes of multiple sites concurrently. Unless the 
manager links the indexes in advance, the Web searches them sequentially. 

PRICE: $4500 

COMPANY NAME: ISYS Search Software Inc (512991) 
SPECIAL FEATURE: Charts 

DESCRIPTORS: Front Ends; IBM PC & Compatibles; Indexing; Information 

Retrieval; Search Engines 
REVISION DATE: 20040524 
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00116139 DOCUMENT TYPe\^ Review 

PRODUCT NAMES: Coirpany — Coir^juter;^ Associates International Inc (850161) 

TITLE: Software Giant's Hardware ' kings 

AUTHOR: Royal, Weld \ 

SOURCE: Industry Week, v248 n4 p72(2) Feb 15, 1999 
ISSN: 0039-0895 

HOMEPAGE: http: //www. Indus tryweek . com 
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05603291 E.I. No: EIP00075234405 

Title: WTMS : a system for collecting and analyzing topic-specific Web 
information 

Author: Mukherjea, Sougata 

Corporate Source: NEC USA Inc, San Jose, CA, USA 

Conference Title: WWW9: 9th International World Wide Web Conference *The 
Web: The Next Generation* 

Conference Location: Amsterdam, Neth Conference Date: 19000515-19000519 
E.I. Conference No.: 56980 

Source: Computer Networks v 33 n 1 2000. p 457-471 

Publication Year: 2000 

CODEN: 003195 ISSN: 1389-1286 

Language: English 

Document Type: JA; (Journal Article) Treatment: X; (Experimental) 
Journal Announcement: 0008W4 

Abstract: With the explosive growth of the World Wide Web, it is becoming 
increasingly difficult for users to collect and analyze Web pages that are 
relevant to a particular topic. To address this problem we are developing 
WTMS, a system for Web topic management. In this paper we explain how the 
WTMS crawler efficiently collects Web pages for a topic. We also introduce 
the user interface of the system that integrates several techniques for 
analyzing the collection. Moreover, we present the various views of the 
interface that allow navigation through the information space. We highlight 
several examples to show how the system enables the user to gain useful 
insights about the collection, (Author abstract) 20 Refs. 

Descriptors: *World Wide Web; Information retrieval; Query languages; 
Graphical user interfaces; Search engines; Web browsers; Algorithms; 
Heuristic programming 

Identifiers: Web topic management systems; Focussed crawling ; 
Information visualization; Graph algorithms; Hubs; Authorities 

Classification Codes: 

723.5 (Computer Applications); 723.3 (Database Systems); 722.2 
(Computer Peripheral Equipment); 723.1 (Computer Programming) 
723 (Computer Software) ; 722 (Computer Hardware) 
72 (COMPUTERS & DATA PROCESSING) 
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05226329 E.I. No: EIP99020003839 

Title: Collaborative web crawling : Information gathering/processing 
over Internet 

Author: Teng, Shang-Hua; LU;. Qi; Eichstaedt, Matthias; Ford, Daniel; 
Lehman, Tobin 

Corporate Source: Univ of Illinois, Urbana, IL, USA 

Conference Title: Proceedings of the 1999 32nd Annual Hawaii 
International Conference on System Sciences, HICSS-32 

Conference Location: Maui, HI, USA Conference Date: 19990105-19990108 
Sponsor: IEEE 

E.I. Conference No.: 49733 

Source: Proceedings of the Hawaii International Conference on System 
Sciences 1999. IEEE Comp Soc, Los Alamitos, CA, USA, PROOOOl. p 186 
Publication Year: 1999 
CODEN: PHISD7 ISSN: 1060-3425 
Language: English 

Document Type: CA; (Conference Article) Treatment: G; (General Review) 
Journal Announcement: 9904W1 

/abstract: In this paper, we present a scalable method for collaborative 
web crawling and information processing. The method includes an automatic 
cyberspace partitioner which is designed to dynamically balance and 
re-balance the load among processors. It can be can be used when all web 



"crawlers are located on a t^gntly coupled high-performance s^B^em as well 
as when they are scattered in a distributed environment. We have 
implemented our algorithms in Java as a part of the IBM Grand Central 
Station (GCS) system. (Author abstract) 

Descriptors: *World Wide Web; Data processing; Algorithms; Java 
programming language 

Identifiers: Collaborative Web crawling; Abstract only 

Classification Codes: 

723.1.1 (Computer Programming Languages) 

723.2 (Data Processing); 723.1 (Computer Programming) 

716 (Radar, Radio & TV Electronic Equipment); 723 (Computer Software) 
71 (ELECTRONICS & COMMUNICATIONS); 72 (COMPUTERS & DATA PROCESSING) 
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6182240 INSPEC Abstract Number: C1999-04-7210N-057 

Title: Collaborative Web crawling : information gathering/processing 
over Internet 

Author(s): Shang-Hua Teng; Qi Lu; Eichstaedt, M. ; Ford, D. ; Lehman, T. 
Author Affiliation: Dept. of Comput. Sci., Illinois Univ., Urbana, IL, 
USA 

Conference Title: Proceedings of the 32nd Annual Hawaii International 
Conference on Systems Sciences. 1999. HICSS-32. Abstracts and CD-ROM of 
Full Papers p. 12 pp. 

Editor (s): Sprague, R.H., Jr. 

Publisher: IEEE Comput. Soc, Los Alamitos, CA, USA 

Publication Date: 1999 Country of Publication: USA liii+341 pp. 
ISBN: 0 7695 0001 3 Material Identity Number: XX-1999-00169 

Conference Title: Proceedings of HICSS 32 - 32nd Annual Hawaii 
International Conference on System Sciences 

Conference Date: 5-8 Jan. 1999 Conference Location: Maui, HI, USA 
Language: English Document Type: Conference Paper (PA) 
Treatment: Practical (P) 

Abstract: The main objective of the IBM Grand Central Station (GCS) 
project is to gather all types of information in any format (text, data, 
image, graphics, audio, video) from cyberspace, to process/index/summarize 
the information, and to push the right information to the right people. 
Because of the very large scale of cyberspace, parallel processing in both 
crawling/gathering and information processing is indispensable. We present 
a scalable method for collaborative Web crawling and information 
processing. The method includes an automatic cyberspace partitioner which 
is designed to balance and re-balance the load dynamically among 
processors. It can be used when all Web crawlers are located on a tightly 
coupled high-performance system as well as when they are scattered in a 
distributed environment. We implemented these algorithms in Java. (12 
Refs) 
Subfile: C 

Descriptors: information resources; information retrieval; Internet; Java 
; resource allocation 

Identifiers: collaborative Web crawling; information gathering; 
information processing; Internet; IBM Grand Central Station project; 
cyberspace; indexing; parallel processing; load balancing; high-performance 
system; distributed environment; Java 

Class Codes: C7210N (Information networks); C7250R (Information retrieval 
techniques ) 

Copyright 1999, lEE 
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01947154 SUPPLIER NUMBER: 18382915 (USE FORMAT 7 OR 9 FOR FULL TEXT) 

Dociutient management embraces the intranet: report from Documation *96. 

(conference on technical information management) (Industry Trend or 

Event) 

Walter, Mark 

Seybold Report on Publishing Systems, v25, nl4, pll(7) 
April 23, 1996 

ISSN: 0736-7260 LANGUAGE: English RECORD TYPE: Fulltext 

WORD COUNT: 5396 LINE COUNT: 00433 

SPECIAL FEATURES: illustration; chart 

COMPANY NAMES: OpenText Corp . --Products ; Documentum Inc . --Products ; Day 
and Zimmermann Information Solutions — Products; DigiTome — Products 

DESCRIPTORS: Industry Event; Trade Show Report; Document Management 
Software; Workflow Software 

SIC CODES: 7372 Prepackaged software 

TRADE NAMES: OpenText LiveLink Intranet (Workflow software ) --Design and 
construction; Documentum 3.0 (Document management software) --Design and 
construction; Interactive Presentation Manager (Multimedia authoring 
software) --Design and construction; Intelligent Document Manager (File 
format conversion software) --Design and construction 

FILE SEGMENT: CD File 275 

abstract . 

If you don't already have summaries, Livelink Search will build them 
for you with its automatic document summary generator . Like most other 
Web- page summarizers. Open Text's is not a triumph of lexical synthesis; 
but. . . 

...example, if you had a directory of abstracted articles, you could ask 
the summarizer to show the abstracts instead of presenting an outline 
of the document.) If you are lucky enough to have valid SGML documents, 
Livelink Search, . , 
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01944268 SUPPLIER NUMBER: 18315432 (USE FORMAT 7 OR 9 FOR FULL TEXT) 

Systems management tools. (1996 Datable Buyer's Guide and Client/ Server 
Sourcebook) (Buyers Guide) \ / 

DBMS, v9, n6, p87(2) \/ 
June 15, 1996 y\ 

DOCUMENT TYPE: Buyers Guide ISSN y^loVL-5173 LANGUAGE: English 

RECORD TYPE: Fulltext; Abstract / \ 

WORD COUNT: 1624 LINE COUNT: ^14 8 \. 

ABSTRACT: A buyer *s guide of 13 svstems management software packages is 
presented. Information presented/ includes a Iwrief description of each 
product / pricing information, information on ope^rating systems and other 
software supported, each vendor ^'s location, and a\:elephone number and 
World Wide Web address, when available, for each vendor. Products 
discussed include a networked /performance analysis to<^l that monitors 
traffic between applications in a client/server environment, a product 
that helps network managers analyze software usage and Mcense compliance, 
a software configuration management package and a tool tnat automatically 
detects application transaction errors in a client/server environment. 

DESCRIPTORS: Software Buyers* Guide; Network Management Utility; Network 

Management Software; Systems Management Utility 
SIC CODES: 7372 Prepackaged software 
FILE SEGMENT: CD File 275 



ABSTRACT: A buyer's guide of 13 syst>^s manajgement software packages is 
presented. Information presented inclsides d brief description of each 
product / pricing information, informati^ ofc operating systems and other 
software supported, each vendor's locatiorK/ and a telephone number and 
World Wide Web address, when available, for'^ch vendor. Products 
discussed include a networked performancej analysis tool that monitors 
traffic between applications in a client/serve A^nvironment , a product 
that helps network managers analyze software usag^ and license compliance, 
a software configuration management package and a tool that automatically 
detects application transaction errors in a client/server environment. 
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ABSTRACT: Individual Inc*s HeadsUp customized electronic news service 
provides subscribers with access to over 700 news sources, including 
articles, news items, press releases and other reports. Subscribers can 
create profiles that specify their particular interests, choosing items 
from a list of over 700 categories, HeadsUp utilizes a string of 486-based 
PCs and a software system called System for Manipulation and Retrieval of 
Text (SMART). SMART locates the items that best match the users' profiles, 
HeadsUp provides subscribers with two-sentence summaries of articles, from 
which subscribers can order the full text versions via a toll-free number. 
The articles are sent within a half hour via fax or E-mail. SMART scans 
through more than 10,000 articles per day and prioritizes them for each 
subscriber. Subscriptions cost $695 per year, which includes 160 full text 
records, or $29,95 per month, which includes one full text record. 

COMPANY NAMES: Individual Inc. — Services 
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technology is under the covers to deliver what is essentially a 
unique newspaper for each subscriber, says Product Manager James 
Leightheiser . HeadsUp is a modular system with a growing string of low-tech 
486 PCs. . . 

...as input processors for the tens of thousands of articles, press 
releases, news items, and other reports generated by over 700 news 
source. After selecting the items, another 486 PC then parses the items, 
ensuring everything is abstracted and presented uniformly. The 
recognition software has intelligence to take full .text items and 
automatically generate abstracts. For example, the software understands 
how a Business Wire item is constructed and uses that information to 
create a meaningful summary. Similarly, the software recognizes items from 
different magazines, news wires, and on-line services... 
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NEW INTERCHANGE STANDARD TO HELP CONNECT INFO ISLANDS 



department within an organ:^Rion. Dl^d^^compliant systems, 



...to be confused with AIIM's ODMA 
standard - the Open Document Management . 
ODMA lets standard desktop applications 
or spreadsheet programs / connect to docume: 
standard makes it easier for users to acces 
for. . . 



as word processing 
repositories. The 
documents without regard 



, . . standard 
known as WebDAV. 

Rochester, N.Y. -based Xerox now is sel^ng a software 
developer *s kit engineered to let vendors build DMA-compliant systems, 
The toolkit includes middleware, a client and a repository. 
Xerox. . . 
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M2 PRESSWIRE-12 January 1999-VERITY: Verity solution chosen by Xerox for 
corporate intranet portal (C) 1994-99 M2 COMMUNICATIONS LTD RDATE: 110199 * 
Verity to help Xerox index and retrieve documents on its corporate 
intranet Verity, Inc. (Nasdaq: VRTY) , today announced that Xerox 
Corporation (NYSE: XRX) has chosen Verity's Information Server and Verity 
Spider to help Xerox users index, search and navigate all of the documents 
on Xeroxs corporate intranet. Verity *s comprehensive solution is designed 
to provide Xerox with a one-stop shopping intranet portal that allows 
employees to search, retrieve , classify and disseminate business-critical 
information from a singlesite. The Xerox portal incorporates the Verity 
Information Server and Verity Spider products for advanced document 
searching, indexing and retrieving. Verity's products, which will reside on 
Xeroxs corporate communication site, are designed to enable Xerox users to 
conduct comprehensive searches of the company's global intranet. "We chose 
Verity because its products enable us to deliver an intranet portal to 
enhance our current knowledge management capabilities, providing the means 
to centrally locate information from more than eight thousand intranet Web 
sites distributed globally." said Christopher Noyes, manager of Internet 
services for Xerox Corporation. "Today Verity's search engine is embedded 
in our DocuShare product which we believe will provide a solid foundation 
for a corporate-wide standard searching capability in the future." In 
addition to technology. Verity is providing on-site consulting services to 
Xerox to ensure the portal's successful deployment. "We view the intranet 
portal as the second generation Of the Internet because it allows 
organisations to gain better control of unmanaged information, " said 
Anthony Bettencourt, senior vice president of worldwide sales and product 
marketing at Verity, Inc. "We are pleased to be working with Xerox to help 
the company not only index and retrieve its large number of documents, 
but also to provide a single search engine that in the long-term can offer 
multi-lingual support and a consistent interface for users to gain access 
to mission- critical information residing on the Xerox corporate intranet." 
Verity Information Server and Verity Spider Prove Capabilities Beyond 
Basic Search Verity Information Server and Verity Spider enable 
organisations to turn corporate intranets into powerful knowledge bases, 
making business information accessible and reusable across the enterprise. 
Used together, the products offer businesses a comprehensive and integrated 
solution, providing information in context. With capabilities far beyond 
basic search and retrieval Verity's enhanced products address the key 



prvblems of corporate knowJlPIre retrieval and content manag^M^t, 
including: * Automated document classification, profiling anS' tagging, 
based on matching documents to the key concepts relevant to individual 
business organisations * Full text and metadata indexing * Search and 
retrieval (retrospective search) Easy to install and use. Verity 
Information Server includes support for the latest versions of commercial 
Web servers, including Netscape NES 3.51 and Microsoft IIS 4.0, and support 
for future Web servers from market leaders. Verity Spider offers 
organisations advanced indexing capabilities, handling documents with less 
interruption than its competitors. Today, organisations use both products 
to improve knowledge-intensive business operations in such areas as 
customer care, business intelligence, marketing, financial analysis, 
project management, e-commerce and on-line publishing. About Xerox Xerox 
corporation (NYSE: XRX) is the global leader in the document processing 
business, providing the widest array of products and consulting services in 
the industry. Xerox markets its publishing systems, copiers, printers, 
scanners, fax machines and document management software in more than 130 
countries. Xerox products and services are designed to help customers share 
their knowledge and master the flow of information from paper to electronic 
form and back again. About Verity Verity is a leading provider of 
enterprise knowledge retrieval solutions for corporate intranets, online 
publishers, e-commerce vendors and market-leading OEMs and ISIS. Verity *s 
product suite enables organisations to turn corporate intranets into a 
powerful knowledge base by creating corporate portals, making business 
information accessible and reusable across the enterprise. Verity *s 
leadership has been recognised by numerous organisations, most recently by 
Delphi Consulting, which gave Verity the Market Recognition Award based on 
a survey of 600 corporate users. Verity's comprehensive and integrated 
product family enables enterprise-wide document indexing, classification, 
search and retrieval, personalised information dissemination, and hybrid 
online and CD publishing - all from the same underlying Verity collection. 
Verity's KeyView products also enable viewing of source documents stored in 
more than 225 formats. Verity's, products are used by more than 1,000 
corporations, government agencies, e-commerce sites, on-line service 
providers, Internet publishers and software developers worldwide. Verity 
alliances include Adobe Systems, AT&T, CNET, Cisco, Compaq, Dow Jones, 
Ernst & Young, Financial Times, IBM, MD Consult, NewsEdge Corporation, 
Informix, Lotus, NEC, Netscape Communications, SAP, Siemens Nixdorf, 
Sybase, Tandem and Time Warner's Pathfinder, This press release contains 
forward-looking statements relating to Verity and the expected performance 
of Verity Information Server and Verity Spider products, as well as 
integration of its products into Xerox and customer environments. Other 
risks relating to Verity and its products are as set forth in its Form 10-K 
and Form 10-Qs as filed with the Securities and Exchange Commission, 
Verity, the Verity logo and Verity are registered trademarks or trademarks 
of Verity, Inc. All other trademarks are the property of their respective 
owners, CONTACT: Sarah Miles/Laura Mason, Marbles Tel: +44 (0)1491 411789 
e-mail: smiles(§marbles . co , uk e-mail: lmason@marbles , co . uk Andrea Hawes, 
Verity, Inc Tel: +44 (0)1372 847604 e-mail: ahawesSverity . com WWW : 
http://www.verity.com *M2 COMMUNICATIONS DISCLAIMS ALL LIABILITY FOR 
INFORMATION PROVIDED WITHIN M2 PRESSWIRE, DATA SUPPLIED BY N7VMED 
PARTY/ PARTIES.* 
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...corporate intranet portal (0)1994-99 M2 COMMUNICATIONS LTD RDATE: 110199 
* Verity to help Xerox index and retrieve documents on its corporate 
intranet Verity, Inc, (Nasdaq: VRTY) , today announced that Xerox 
Corporation (NYSE: XRX) has chosen Verity's Information Server and Verity 
Spider to help Xerox users index, search and navigate all of the documents 
on Xeroxs corporate intranet. Verity... 




...is designed to provide Xerox with a one-stop shopping intranet portal 
that allows employees to search, retrieve , classify and disseminate 
business-critical information from a singlesite. The Xerox portal 
incorporates the Verity Information Server and Verity Spider products for 
advanced document searching, indexing and retrieving. Verity *s products, 
which will reside on Xeroxs corporate... 

...Inc. "We are pleased to be working with Xerox to help the company not 
only index and retrieve its large number of documents, but also to 
provide a single search engine that in the long-term can offer 
multi-lingual support and a consistent interface for users to gain access 
to mission- critical information residing on the Xerox corporate intranet." 
Verity Information Server and Verity Spider Prove Capabilities Beyond 
Basic Search Verity Information Server and Verity Spider enable 
organisations to turn corporate intranets into powerful knowledge bases, 
making business information accessible and reusable across... 
...tagging, based on matching documents to the key concepts relevant to 
individual business organisations * Full text and metadata indexing * 
Search and retrieval (retrospective search) Easy to install and use, Verity 
Information Server includes support for... 

...3.51 and Microsoft IIS 4.0, and support for future Web servers from 
market leaders. Verity Spider offers organisations advanced indexing 
capabilities, handling documents with less interruption than its 
competitors. Today, organisations use both... 

...contains forward-looking statements relating to Verity and the expected 
performance of Verity Information Server and Verity Spider products, as 
well as integration of its products into Xerox and customer environments. 
Other risks relating to. . . 
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report building faster and easier while improving report 
functionality. The new Zanza MetaBuilder makes it easy to create and 
manage Zanza metadata , the database schema abstraction layer that 
simplifies query building, by providing more understandable business views 
of the underlying data. The MetaBuilder... 
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ABSTRACT: 

ICONOVEX of Bloomington, MN, has come out with Anchorpage, a computer 
software that makes Web sites more user-friendly. Anchorpage allows Web 
site users to utilize key content and key phrases in searching the Web. 
Through ICONOVEX 's new product , users can also compose presentation 
views, abstract views, phrase views and table of contents views. By 
selecting which views will be shown to web site users, web site 
administrators make searching easier for the users. Anchorpage also 
automatically searches databases and composes the views, eliminating the 
need for the site administrator to compose web navigation guides manually. 
PUBLISHER NAME: North American Publishing Company 
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ABSTRACT: 

...site users to utilize key content and key phrases in searching the Web. 
Through ICONOVEX* s new product , users can also compose presentation 
views, abstract views, phrase views and table of contents views. By 
selecting which views will be shown to web site users, web site 
administrators make searching easier for the users. Anchorpage also 
automatically searches databases and composes the views, eliminating the 
need for the site administrator to compose web navigation. . . 



